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Abstract 

We study the problem of maximizing sum rates in a Gaussian interference- 
limited channel that models multiuser communication in a CDMA wireless 
network or DSL cable binder. Using tools from nonnegative irreducible matrix 
theory, in particular the Perron-Frobenius Theorem and the Friedland-Karlin 
inequalities, we provide insights into the structural property of optimal power 
allocation strategies that maximize sum rates. Our approach is similar to the 
treatment of linear models in mathematical economies, where interference is 
viewed in the context of competition. We show that this maximum problem 
can be restated as a maximization problem of a convex function on a closed con- 
vex set. We suggest three algorithms to find the exact and approximate values 
of the optimal sum rates. In particular, our algorithms exploit the eigenspace of 
specially crafted nonnegative interference matrices, which, with the use of stan- 
dard optimization tools, can provide useful upper bounds and feasible solutions 
to the nonconvex problem. 

Keywords and phrases: Sum rates, maximization of convex functions, spec- 
tral radii of irreducible matrices, linear programming, wireless networks. 
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1 Introduction 



Interference control is a major and fundamental issue in the design of a multiuser 
communication system, where interference is a central phenomenon. The goal of 
an interference-limited system is to develop a resource allocation scheme that is 
able to jointly optimize the performance of all users in the presence of mutual 
interference. To meet the growing demand of untethered applications, transmission 
rate adaptation by power control has emerged as active research issues in multiuser 
systems, which often impact the design of higher networking layers. This is due to 
innovations at the link layer and the need for utility maximization over networks 
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[19, 6], where a system- wide performance objective has to be maximized or the 
quality of service for each individual user has to be met. 

Power control is often used in multiuser communication system such as cellular, 
ad-hoc networks and digital subscriber lines (DSL) to provide a high signal-to-noise 
(SNR) ratio to each user for a reliable connection. A higher SNR allows a system with 
link adaptation to transmit at a higher data rate, thus leading to a greater spectral 
efficiency. However, in a multiuser system, the performance of each user depends 
not only on its power allocation, but also on the power allocation of other users. 
There is thus a tradeoff among the data rates of all users, when a single system- wide 
performance metric or network-wide utility has to be optimized. Intelligent power 
control is necessary to jointly maximize the overall performance and mitigate the 
interference of all users. 

The motivation of this work comes from maximizing sum rate or data throughput 
in wireless communication channels or DSL. Due to the broadcast nature of radio 
transmission, data rates in a wireless network are affected by multiuser interference. 
This is particularly true in Code Division Multiple Access (CDMA) systems, where 
users transmit at the same time over the same frequency bands and their spreading 
codes are not perfectly orthogonal. Transmit power control is often used to control 
signal interference to maximize the total transmission rates of all users. In a DSL 
broadband access system, multiple users share a cable binder and communicate 
with a central office through a common spectrum [16, 24]. Signals bundled in the 
close proximity of a cable binder interfere with one another due to electromagnetic 
coupling between neighboring twisted-pairs. In this paper, we focus primarily on 
sum rate maximization in a wireless communication channel over a single frequency, 
but we will also show how our results can be easily extended to find application to 
optimal spectrum management over multiple frequencies as in a DSL channel. 

From an information theoretic viewpoint, finding the capacity region and opti- 
mal code design of the interference channel is still an open problem even for the 
simple case of two users. A practical approach that leads to an inner bound of 
the unknown capacity region or achievable rate region is to adopt a noncoopera- 
tive approach that treats interference as additive Gaussian noise, i.e., no multiuser 
detection or interference cancellation is used. The Gaussian assumption on the in- 
terference distribution is justified, especially when the number of interfering users is 
large, e.g., in existing DSL cable binder, there are around fifty to hundred twisted 
pairs in a cable binder. Such an approach is also relevant in practical systems, be- 
cause the signaling overhead involved in coordination is minimal (leading to simple 
channel access protocols without a centralized coordinator and the use of single user 
codes) and transceiver techniques with low complexity can be employed at each user. 

The rate region characterizes all possible data rate combinations among all users 
subject to power constraints while making the decoding error probability arbitrary 
small. Obtaining the optimal transmit power control scheme that maximizes sum 
rates requires solving a nonconvex problem [20, 5, 9, 15, 24, 4]. The computational 
aspect of an exhaustive search over all possible power allocations is very high. As the 
number of users increases, it becomes hard to obtain optimal solutions in polynomial 
time. While global optimality in power control allocation is highly desired, it is 
often of necessity to find local optimal solution with performance guarantees in the 
presence of nonconvexities. However, weighted sum rate is not only difficult to 
optimize, but the problem may even be hard to approximate [16]. 
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In this paper, using tools from nonnegative irreducible matrix theory, in partic- 
ular the Perron-Frobenius Theorem and the Friedland-Karlin inequalities, we pro- 
vide insights into the structural property of optimal power allocation strategies that 
maximize sum rates. Our approach is similar to the treatment of linear models 
in mathematical economies, where interference is viewed in the context of competi- 
tion. In particular, we show that we can convert the problem of maximizing sum rate 
to a problem of maximizing a convex function on a closed unbounded convex set. 
This approach is based on construction of interference matrices that characterize 
the problem, which is similar to the role of input-output matrices in mathematical 
economies. More importantly, it leads to the design of new algorithms for finding 
the exact and approximate solutions to the maximum problem. By focusing on fast 
and simple algorithms, interference among multiple users in a common spectrum 
can be mitigated so that efficient data multiplexing can be realized. 

We now state briefly the maximal problem. (See §2 for all definitions, notations 
and motivations.) Let F = [fij\i = j = i, v = (v±, . . . ,vl) t be an L x L matrix, with 
zero diagonal and positive off diagonal elements, and a positive vector, respectively. 
Define the following transformation 7(p) = (71 (p), • • • , 7l(p)) T from the set of 
nonnegative vectors p <G to itself: 

Pi 

7i(p) = — z , for I = 1,. . . ,L, where p = (p u . . . ,pf) > 0. 

2^m=l JlmPm + V t 

Let p = (pi, . . . ,Pl) t be a given positive vector. For a given probability vector 
w = (wi, . . . ,w L ) T , let 

L 

$w(7) :=5^Wilog(l + 7i), where w, 7 G R£. (1.1) 

i=i 

Then the sum rate maximization problem in Gaussian interference-limited channels 
is 

max $( 7 (P)). (1.2) 

0<Pi<P; VZ 

The exact solution to this problem is known to be (strongly) NP-hard [16]. An often 
used classical technique to tackle nonconvexity is the Lagrange duality. The short- 
coming of using this approach is however that there can exist a duality gap between 
the global optimal primal and dual solution of (1.2) [16]. Also, finding an optimal 
primal solution given an optimal dual solution, or vice versa, is in general very dif- 
ficult. In contrast to simply resorting to classical global optimization techniques, 
we will highlight in this paper the insightful role of using irreducible nonnegative 
matrix theory to tackle nonconvexity of the form given by (1.2). 

For any vector 7 = (71, ... , 7x) T G M L let = (e^ 1 , . . . , e* L ) . The purpose 
of this paper to show that the sum rate maximization problem equivalent to the 
maximization problem of the convex function <I> w (e" 1 '), where 7 varies on a closed 
unbounded convex domain D({F} C K L ). (See (4.6).) Note that this formulation 
shows that we can not expect, in general, that a maximal solution p* is unique. 
We show how to approximate this domain by a convex polytope D(C\, ■ ■ ■ , Cm> ^0> 
formed mainly by the supporting hyperplanes of D({F}). This approximation is 
obtained by using the Friedland-Karlin inequality stated in [10]. Hence, it is easier 
to find the maximum < I>(e" ) ') on D(Q 1 , . . . , £ M , K), than on D({F}). Finally we 
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approximate the function $(e^) by the linear function w T 7 = YliLi w ili- Then the 
maximum of w T 7 on -D(Ci> • • • , Cm-i K) 1S a linear programming problem, which can 
be solved in polynomial time using the ellipsoid algorithm [12]. 

We survey briefly the contents of the paper. In §2 we state definitions, notations 
and a short motivation. We give a characterization of the image of the multidimen- 
sional box [0, p] C R+ by the map 7, in terms of certain inequalities of the spectral 
radii of corresponding nonnegative matrices. In §3 we study the sum rate maxi- 
mization problem in wireless channel. We give necessary and sufficient conditions 
for an extremal point p G [0, p] of 3> w (7(p)) to be a local maximum. In §4 we study 
several relaxation versions of the sum rate maximization problem. These relaxation 
version are simpler than the original maximal problem. When replacing the maxi- 
mal function $ w by the linear function w T 7, we have a closed-form solution under 
certain conditions on w. This solution is obtained by using the Friedland-Karlin 
inequality stated in [10]. In §5 we give three algorithms to solve the sum rate max- 
imization problem and its two other approximation versions described above. In §6 
we apply our results to sum rate maximization in DSL channel. We conclude the 
paper in §7. In §A, viewed as an appendix, we restate some useful results for [10] 
and give several applications and extensions, which are needed in this paper. 



2 Notations and preliminary results 

As usual, let R mxn d R™ xri denote the set of m x n matrices and its subset of 
nonnegative matrices. For A,B G R mxn , we denote A < B \i B - A e M™ xn . 
We denote A<B,A<B if B — A is a nonzero nonnegative and positive matrix, 
respectively. We denote the entries of a matrix A G R mxn by the small letters, i.e 
A = [aij]$2i- Identify R m = R mxl ,R™ = M™ xl . 

A column vector is denoted by the bold letter x = (x\, . . . ,x m ) T G R m . Then 
e x ._ ^ e a;i^ _ _ _ ^ e Xm ) T . For x > 0, we let x -1 := (^-,...,^-) T and logx = 

(logxi, . . . , logxi) T . For any x = (xi, . . . , xl) t G R l , we let e x = (e xi , . . . , e XL ) T . 
Let xoy denote the Schur product of the vectors x and y, i.e., xoy = [x±yi, . . . ,xlUl] T 
Let 1 = (1, ... , 1) T G R L . For p < p G R L , denote by [p, p] the set of all x G R L 
satisfying p < x < p. For a vector y = (y±, . . . ,Vl) T , denote by diag(y) the di- 
agonal matrix diag(yi, . . . , yi). We also let (-By); denote the Ith element of By. 
The Perron- Frobenius eigenvalue of a nonnegative matrix F is denoted as p(F), and 
the Perron (right) and left eigenvector of F associated with p(F) are denoted by 
x(F) and y(F) (or simply x and y when the context is clear) respectively. Assume 
that F is a nonnegative irreducible matrix. Then p(F) is simple and positive, and 
x(F),y(F) > 0. We will assume the normalization: x(F) o y(F) is a probability 
vector. The super-script (•) denotes transpose. For a positive integer n, denote by 
(n) the set {1, . . . , n}. Let P : X — > Y be a mapping from the space X to the space 
Y. For a subset Z C X, we denote by P{Z) the image of the set Z. 

Consider a wireless interference channel, e.g., cellular network, with L logical 
transmitter /receiver pairs. In the cellular uplink case, all logical receivers may 
reside in the same physical receiver, i.e., the base station. The interference- limited 
channel with L users can be modeled as a Gaussian interference channel given by 



4 



the baseband signal model: 



y t = huXi+^hijXj + zi, (2.1) 

where yi € C lxl is the received signal of the Zth user, h\j G C lxl is the channel 
coefficient between the transmitter of the jth user and the receiver of the Ith user, 
x G (£ Nxl i s the transmitted (information carrying) signal vector, and z{s are the 
i.i.d. additive complex Gaussian noise coefficient with variance ni/2 on each of 
its real and imaginary components. The first term on the right-hand side of (2.1) 
represents the desired signal, whereas the second term represents the interfering 
signals from other users. At each transmitter, the signal is constrained by an average 
power constraint, i.e., E[|a^| 2 ] = pi, which we assume to be upper bounded by pi for 
all I. 

The vector (pi,... ,pl) is the transmit power vector and is the parameter of 
interest in this paper. Let G = [gij][j =1 > OlxL represents the channel gain, where 
9ij = \hij\ 2 is the channel gain from the jth transmitter to the Ith receiver, and 
n = (m, . . . ,ul) t > 0, where n; is the noise power at the Zth receiver. Assuming 
a linear matched-filter receiver, the Signal-to- Interference Ratio (SIR) for the Ith 
receiver is denoted by 7/. 

For p = (pi, . . . ,pl) T > 0, we define the following transformation: p 1— > 7(p), 
where 

7 Kp) := ^ 9UPl - , 1 = 1,. ..,L, 7(P) = (71(P),...,7L(P)) T . (2.2) 
l^j^i 9ijPj + ni 



Define 



F = where f tj = [m t \ + ) ( 2 - 3 ) 



and 



m n 2 n L \ 



T 



g = (9u,---,9ll) , n = (m,...,n L ) ,v= — , — . (2.4) 

522 9llJ 

Then 

7 (p) = P o(Fp + v)- 1 . (2.5) 

Claim 2.1 Let p > be a nonnegative vector. Assume that 7(p) is defined by 
(2.2). Then p(diag(7(p))F) < 1, where F is defined by (2.3). Hence, for 7 = 7(p), 

p = P( 7 ) := (I - diag( 7 )F)- 1 diag( 7 )v. (2.6) 

Vice versa, if 7 is in the set 

T := {7 > 0, p(diag( 7 )F) < 1}, (2.7) 

then the vector p defined by (2.6) is nonnegative. Furthermore, 7(P(p)) = 7. That 
is, 7 : — > r, and P :T — > are inverse mappings. 
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Proof. Observe that (2.2) is equivalent to the equality 

p = diag( 7 )Fp + diag( 7 )v. (2.8) 

Assume first that p is a positive vector, i.e., p > 0. Hence, 7(p) > 0. Since all off- 
diagonal entries of F are positive it follows that the matrix diag( 7 )F is irreducible. 
As v > 0, we deduce that max i6 [ ljn ] ^'^(^p)' < \ xhe minmax characterization 
of Wielandt of p(diag( 7 )F), [23] and [11, (38), pp.64], implies /)(diag( 7 )F) < 1. 
Hence, 7 (p) € T. Assume now that p > 0. Note that pi > <^=^ 7 i(p) > 0. So 
p = <^=> 7(p) = 0. Clearly, p( 7 (0)F) = p(0lxl) = < 1. Assume now that 
p > 0. Let A = {i : Pi > 0}. Denote 7 (p)(*4) the vector composed of positive 
entries of 7 (p). Let F(A) be the principal submatrix of F with rows and columns in 
A. It is straightforward to see that p(diag( 7 (p))F) = p( K di&g(j( K p)( K A)F(A)). The 
arguments above imply that 

p(diag( 7 (p))F) = p(diag( 7 (p)(^)F(^)) < 1. 

Assume now that 7 <G T. Then 

oo 

(7 - diag^TT 1 = ]T(diag( 7 )F) fc > LxL . (2.9) 

fc=0 

Hence, 7- > ( 7 ) > 0. The definition of P( 7 ) implies that 7 (P( 7 )) = 7 . □ 

Claim 2.2 The set T C is monotonic with respect to the order >. That is 
if*/ G r and 7 > (3 > then f3 € T. Furthermore, the function P( 7 ) is monotone 
on r. 

P{l) > P(P) iff G T and 7 > (3 > 0. (2.10) 
Equality holds if and only if 7 = (3. 

Proof. Clearly, if 7 > (3 > then diag( 7 )F > diag(/3)F which implies p(diag( 7 )F) > 
/?(diag(/3)7 1 ). Hence, T is monotonic. Use the Neumann expansion (2.9) to deduce 
the monotonicity of P. The equality case is straightforward. □ 

Note that 7 (p) is not monotonic in p. Indeed, if one increases only the ith 
coordinate of p, then one increases the ith coordinate of 7 (p) and decreases all 
other coordinates of 7 (p). 

As usual, let = (8n, . . . , ^l) t , % = 1, . . . , L be the standard basis in R L . In 
what follows, we need the following result. 

Theorem 2.3 Let I £ [1,7] be an integer and a > 0. Denote [0,a]i x IR^ _1 
the set of all p = (pi, . . . ,pl) T £ R+ satisfying pi < a. Then the image of the set 
[0,a]i x IR^r 1 by the map 7 (2.2), is given by 

p(diag( 7 )(F + (l/a)ve z T )) < 1, < 7 . (2.11) 

Furthermore, p = {pi, ■ ■ ■ ,Pl) G satisfies the condition p\ = a if and only if 
7 = 7 (p) satisfies 

P (diag(7)(F+(l/a)veT)) = l. (2.12) 
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Proof. Suppose that 7 satisfies (2.11). We claim that 7 G T. Suppose first that 
7 > 0. Then diag(7)(F + tive / T ) < di&g(~f)(F + t 2 vej) for any t\ < t 2 . [11, Lemma 
2, §2, Ch. XIII] yields 

/ 9(diag( 7 )F) < p(diag(7)(F + iive^)) < p(diag( 7 )(F + t 2 vej)) < (2.13) 
p(diag(7)(F + (l/a)vej)) < 1 for < h < t 2 < l/o. 

Thus 7 € r. Combine the above argument with the arguments of the proof of Claim 
2.1 to deduce that 7 G T for 7 > 0. 

We now show that P("f)i < a. The continuity of P implies that it is enough 
to consider the case 7 > 0. Combine the Perron-Frobenius theorem with (2.13) to 
deduce 

< det(J - diag(7)(F + tvej)) for t G [0, a" 1 ). (2.14) 

We now expand the right-hand side of the above inequality. Let B = xy T G R LxL 
be a rank one matrix. Then B has L — l zero eigenvalues and one eigenvalue equal to 
y T x. Hence, 7— xy T has L — l eigenvalues equal to 1 and one eigenvalue is (1— y T x). 
Therefore, det(7 — xy T ) = 1 — y T x. Since 7 G T we get that (7 — diag(7)F) is 
invertible. Thus, for any t G K 

det(7 - diag(7)(F + tvej)) = 
det(7 - diag( 7 )F) det(7 - t((I - diag^T 1 )" 1 diag(7)v)e i r ) (2.15) 
det(7 - diag(7)F)(l - tej(l - diag(7)F)~ 1 diag(7)v). 

Combine (2.14) with the above identity to deduce 

1 > te^(7 - diag( 7 )F)- 1 diag( 7 )v = tP{j)i for t G [0, (T 1 ). (2.16) 

Letting t f a" 1 , we deduce that P{f)i < a. Hence, the set of 7 defined by (2.11) 
is a subset of 7QO, a]i x M^ _1 ). 

Let p G [0, a]i x R^T 1 and denote 7 = 7(p). We show that 7 satisfies (2.11). 
Claim 2.1 implies that p(diag(7)F) < 1. Since p = P(7) and pi < a we deduce 
(2.16). Use (2.15) to deduce (2.14). As p(diag(7)F) < 1, the inequality (2.14) 
implies that p(diag(7)F + tv T e^) < 1 for t G (O,^ 1 ). Hence, (2.11) holds. 

It is left to show the condition (2.12) holds if and only if P("f)i = a. Assume 
that p = (pi, . . . ,pi) T G R+, pi = a and let 7 = 7(p). We claim that equality 
holds in (2.11). Assume to the contrary that p(diag(7)(F + (l/a)ve / T )) < 1. Then, 
there exists f3 > 7 such that p(diag(/3)(7 1 + (l/a)\rej)) < 1. Since P is mono- 
tonic P((3)i > pi = a. On the other hand, since (3 satisfies (2.11), we deduce that 
P(/3)i < a. This contradiction yields (2.12). Similarly, if 7 > and (2.12) then 
P( 7 ), = a. □ 

Corollary 2.4 Let p = (pi, . . . ,Pl) T > be a given positive vector. Then 
7([0, p]), the image of the set [0, p] by the map 7 (2.2), is given by 

p (diag( 7 ) (F + (1/pOveT)) < 1, for I = 1, . . . ,L, and 7 G M+. (2.17) 
In particular, any 7 G satisfying the conditions (2.17) satisfies the inequalities 
7 < 7 = (71 1 • • • >7l) T ; w^ere 7/ = — , i = 1, . . . ,L. (2.18) 
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Proof. Theorem 2.3 yields that 7([0,p]) is given by (2.17). (2.5) yields 

/ x Pi ^ Pi ^ Pi f ^ m -l 
11 p = ((t? \ i V < — < — for p G 0,p . 

Note that equality holds for p = pi&i. □ 



3 The sum rate maximization problem 

We assume a singe-user decoder at each user, i.e., treating interference as additive 
Gaussian noise, and all users has perfect channel state information at the receiver. 
We also assume that fading occurs sufficiently slowly in the channel, i.e., flat-fading, 
so that the channel can be considered essentially fixed during transmission. As- 
suming that all users employ achievable random Gaussian codes, we can use the 
Shannon capacity formula for the maximum information rate on a link. In practice, 
Gaussian codes can be replaced by finite-order signal constellations such as the use 
of quadrature-amplitude modulation (QAM) or other practical (suboptimal) cod- 
ing schemes. Assuming a fixed bit error rate (BER) at the receiver, the Shannon 
capacity formula can be used to deduce the achievable data rate of the Zth user as 
[7]: 

log ( 1 + -^-) nats/symbol, (3.1) 



where T is the SNR gap to capacity, which is always greater than 1. In this paper, we 
absorb (1/r) into G\\ for all /, and write the achievable data rate as log(l + 7j(p)). 

Let w = (wi, . . . , wl) T > be a given probability vector, which is assigned 
by the network to each link (to reflect some long-term priority). The problem of 
maximizing sum rate in a Gaussian interference-limited channel can be stated as 
the following maximum problem 

L 

max yVlog(l + 7,(p)). (3.2) 

Lemma 3.1 Let w be a probability vector, and assume that p* = (jp\, . . . ,p£) T 
is a maximal solution to (3.2). Then p* = pi for some i. Furthermore if Wj = 
then p* = 0. 

Proof. Assume to the contrary that p* < p. Let 7* = 7(p*). Since P is 
continuous on T, there exists 7 6 T such that 7 > 7* such that P(7) < p. Clearly, 
3>w(7(p*)) < 3>w(7)- As 7 = 7(P(7)), we deduce that p* is not a maximal solution 
to (3.2), contrary to our assumptions. 

Suppose that Wj = 0. For p = (pi, . . . ,pi) T , let pj be obtained from p by 
replacing the jth coordinate in p by 0. Assume that pj > 0. Then 7«(p) < Ji(Pj) 
for i / j. Since Wj = 0, it follows that 3> w (7(p)) < $ w (7(pj)). □ 

Combine the above lemma with Theorem 2.3 and Corollary 2.4 to deduce an 
alternative formulation of (3.2). 
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Theorem 3.2 The maximum problem (3.2) is equivalent to the maximum prob- 
lem. 

maximize J2i w l l°s(l + ll) 

subject to p(diag(7)( J F + (l/p,)ve, T )) < 1 VZ G (L), (3.3) 
variables: 7/, VZ. 

7* is a maximal solution of the above problem if and only if P(7*) is a maximal 
solution p* 0/ i/ie problem (3.2). In particular, any maximal solution 7* satisfies 
the equality (2.17) for some integer I G [l,L]. 

We now give the following simple necessary conditions for a maximal solution 
p* of (3.2). We first need the following result, which is obtained by straightforward 
differentiation. 

Lemma 3.3 Denote by 

V 1 + 71 1 + 7l / 

the gradient of <I> W . Let 7(p) be defined as in (2.2). Then H(p) = [^]f = j =l! the 
Hessian matrix o/ 7 (p), is given by 

H(p) = diag((Fp + v)- x )(- diag( 7 (p))F + /). 

In particular, 

V p $ w ( 7 (p)) = H(p) T V$ w ( 7 (p)). 

Corollary 3.4 Let p* = {p\, . . . ,p* L ) T be a maximal solution to the problem 
(3.2). Divide the set (L) = {1, . . . ,L} to the following three disjoint sets S max , Si n , So-' 

S max = {»6 (L), p* = Pi}, S in = {i G (L), p* G (0,pi)}, S = {» G (L), p\ = 0}. 

XTien i/te following conditions hold. 

(H(p*) T V<I> w (7(p*))) l >0forie 

(H(p*) T W w (7(p*)))* = /or i G S in , (3.4) 





(H(p*) T V$ w ( 7 (p*)))i < for i G S 



Proof. Assume that p\ = pj. Then ^t$ w (7(p))(p*) > 0. Assume that < 

p* < ft. Then J-$ W (7(P))(P*) = 0. Assume that p* = 0. Then J-$w(t(p))(p*) < 
0. □ 



We now show that the maximum problem (3.3) can be restated as the maximum 
problem of convex function on a closed unbounded domain. For 7 = (71, ... , 7l) T > 
let 7 = log 7, i.e. 7 = e" 7 '. Recall that for a nonnegative irreducible matrix 
B G M^ xL lo g/ o(e x 5) is a convex function [13]. Furthermore, log(l + e*) is a strict 
convex function in t G R. Hence, the maximum problem (3.3) is equivalent to the 
problem 

maximize Y2i w l l°g(l + ) 

subject to logp(diag(e^)(F + (l/pj)ve, T )) < V/ G (L), (3.5) 
variables: 7 = (71, ... , 7 n ) T G R L . 
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The unboundedness of the convex set in (3.5) is due to the identity = e~°°. In 
view of Lemma 3.1, it is enough to consider the maximal problem (3.2) in the case 
where w > 0. 

Theorem 3.5 Let w > be a probability vector. Consider the maximum prob- 
lem (3.2). Then any point < p* < p satisfying the conditions (3.4) is a local 
maximum. 

Proof. Since w > 0, <E> w (e^) is a strict convex function in 7 G M L . Hence, the 
maximum of (3.5) is achieved exactly on the extreme points of the closed unbounded 
set specified in (3.5). (It may happen that some coordinate of the extreme point are 
—00.) Translating this observation to the maximal problem (3.2), we deduce the 
theorem. □ 

We now give simple lower and upper bounds on the value of (3.2). 

Lemma 3.6 Consider the maximal problem (3.2). Let B\ = (F + (l/pi)vej)) 
for I = 1, . . . , L. Denote R = max ie ( L ) p(Bi). Let 7 be defined by (2.18). Then 

$ w ((l/i?)l) < max $ w ( 7 (p) < $ w ( 7 ). 

pS[0,p] 

Proof. By Corollary 2.4, 7(p) < 7 for p € [0, p]. Hence, the upper bounds 
holds. Clearly, for 7 = (1/R)1, we have that p(diag(7)5j) < 1 for I G (L). Then, 
from Theorem 3.2, 3> w ((l/i?)l) yields the lower bound. Equality is achieved in the 
lower bound when p* = tx(Bi), where i = argmax; G ^ p(Bi), for some t > 0. □ 



4 Relaxations of the maximal problem 

In this section, we study several relaxed versions of (3.3), which will be used later 
to construct algorithms to solve (3.2). 

From the definition of j(p), we deduce that 



Define 



9u ~r. 9u 



F = [/y]fe =1) /„ = — l -, fij = ^ for j± I, I = 1, . . . , L. (4.2) 

' J gapi 911 



Lemma 4.1 Let < p < p. Assume that 7(p) and F are defined by (2.2) and 
(4-2), respectively. Then 

p > diag( 7 (p))Fp, (4.3) 

and 

p(diag(7(p)) J F) < 1. (4.4) 



10 



Proof. The assumption that < p\ < p\ implies that ^ > fuPi- Then using 
(4.1), the definition of F and the above observation implies (4.3). The inequality 
(4.4) is a consequence of Wielandt's characterization of the spectral radius of an 
irreducible matrix [11]. Indeed, if p > 0, i.e. all the coordinates of p are posi- 
tive, then 7(p) > 0. Hence, diag(7(p)))F is a positive matrix. Then Wielandt's 
characterizations claims 

p(diag( 7 (p))F) < max ^ilM^ < L 

l=l,...,L pi 

Observe next that if p\ = 0, then 7(p)z = 0. So if some of pi = 0, then p(diag(7(p))F) 
is the spectral radius of the maximal positive submatrix of diag(7(p))F. Apply to 
this positive submatrix Wielandt's characterization to deduce (4.4). □ 



Lemma 4.2 The maximum 

max $w(7) ( 4 - 5 ) 

-Y6K^,p(diag(-Y)F)<l 

is not less than the maximum of the problem (3.2). 

Proof. In view of (4.4), we see that the maximum in (4.5) is on a bigger set 
than the maximum in (3.2). □ 

By considering a change of variable in the constraint set of (4.5), we define the 
following set: 

D(F) = {7 € R L , logp(diag( e T')F) < 0}, (4.6) 
which is convex in R L . 

Corollary 4.3 We have p(di&g(j')F) = 1 in (4-5), where 7' solves (4-5) opti- 
mally. 

Lemma 4.4 // p* = p or p* is such that p\ = for some I and p* = pj for 
j ^ I, then 

/>(diag(7( P *))F) = 1. (4.7) 

Proof. The definition of F implies (4.7) for p* = p. Assume now that p\ = 
for some I. Then 7;(p*) = for some I. Then, the Ith row of diag( j j(p*))F is zero. 
Let Fi be the submatrix of F obtained by deleting the Ith row and column. Let 7 1 
be the vector obtained from 7 by deleting the Ith coordinate. Hence, the character- 
istic polynomial of diag(7)F, det(xJ — diag(7)F, is equal to xdet(xl — diag^jJ-Fi). 
Therefore, p(diag(7)F) = p(diag(7 1 )Fi). Continuing in this manner, we deduce the 
lemma. □ 



Corollary 4.3 and Lemma 4.4 imply that if the optimizer of (4.5) 7' satisfies 
P{l') ^ P> then P(-f') is also the global optimizer of (3.2). 

We now state a stronger relaxation problem than (4.5). Consider the transfor- 
mation 7(p, n) given in (2.2) as a function of p and the noise n > 0. Clearly, 
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7(p,ni) > 7(p,n2) for any ri2 > ni > and p > 0. Hence, it would be useful to 
consider the limiting case <5(p) := 7(p, 0): 

A(p) := ^ 9UPl , I = 1, ■ ■ ■ , L, /3(p) = (/?!(p), . . . , /3 L (p)) T , p > 0. (4.8) 
l^&l 9ljPj 

Hence, /3(p) can be viewed as communication in noiseless channels [1]. Another 
way to look at the noiseless channels is to allow channels with unlimited power p, 
i.e. pi = oo, i = 1, . . . , L. This remark is implied by the identity 

j(tp, n) = 7(p, in) for any t > 0. 

Denote by = {p S R+, l T p = 1} the set of probability vectors in R+. Since 
/3(p) = /3(tp) for any t > 0, we deduce 

sup $ w (/3(p)) = max * w (/3(p)). (4.9) 

P6K^\{0} peIli 

Clearly, the maximum in (4.9) is greater than the maximum in (3.2). 
Note that the definition of F given by (2.3) implies that 

diag(/3(p))Fp = p, and p(diag(/3(p))F) = 1 for any p > 0. (4.10) 

Hence, the maximum (4.9) is equal to 

max $ w (/3) = max $w(/3)- ( 4 -ll) 

£ ,p(diag(/3)F)< 1 /3eM^ ,p(diag( / 3)F)=l 



Since F < F and i 7 is positive, it follows that p(diag(7)i ? ) < p(diag(7)F) for any 
7 > [11]. Hence, the maximum in (4.11) is greater than the maximum in (4.5). 

Since it is generally difficult to determine precisely the spectral radius of a given 
matrix [22], the relaxed problems given by (4.5) and (4.11) enables one to find an 
upper bound to (3.2) quickly when L is fairly large. 

We next modify the function <& w (e' 7 ) appearing in (5.4). Consider the function 
log^ + e 1 ). If e x is relatively small, then log^ + e 21 ) is close to zero. If e x is assumed 
to be bigger than 1, then log(l + e x ) « \oge x = x. Hence, it is reasonable to 
approximate log(l + e x ) by x (Note that log(l + e x ) > x). 

Theorem 4.5 Let w = (w±, . . . ,wl) T be a positive probability vector. Then 

L L 

max _ V wi log 7/ = V w t log 7*, (4.12) 

T=(7l,-,7L) T >0,p(diag(-7)F)<l ^ 

where 7* = (7*,...,7^) T > is the unique vector satisfying the following condi- 
tions. Let A* = diag(7*)F. Then p(A*) = 1 and x(A*) o y(A*) = w. 

Proof. In the maximum problem (4.12), it is enough to consider 7 > such 
that p(diag('j) F) = 1. Then each of such 7 is of the form p o (Fp) _1 , for some 
< z (= x(diag(7)F) in Theorem A. 3. Use Theorem A. 3 and Corollary A. 6 to 
deduce the theorem. □ 

Combining Theorem A. 3 and Corollary A. 9 we deduce the following result. 
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Theorem 4.6 Let w = (wi, . . . ,wl) be a positive probability vector satisfying 
the conditions (A. 6). Let F be either equal to F or to F + (l/pi)vej for some 
I £ (L), respectively. Then 

L L 

max _ wi log 7/ = V] w t log 7* , (4.13) 

T=(7i,-,7L) T >0,p(diag( T )i ;, )<l JT[ JT[ 

where 7* = (7*, . . . , 7£) T > is a vector satisfying the following conditions. Let 
A* = diag(7*)F. Then p{A*) = 1 and x(A*) o y(A*) = w. 

Note that the choice F = F + (l/pi)vej applies, if we assume that the optimal 
solution p* = (p\, ■ ■ ■ ,p* L ) T of the maximal problem (3.2) satisfies p\ = p\ and 
p* < pj for j / /. 

The above last two theorems enable us to choose w for which we know the 
solution to the maximal problems (4.12) and (4.13). Namely, choose /3 1 , / 9 2 > 
such that A\ = diag^jjF, A2 = diag(/3 2 )-F have spectral radius one. Let Wj = 
x(Aj) o y(Ai) for i = 1,2. Then for wi, (4.12) has the unique maximal solution 
7* = (3i- For W2, (4.13) has a maximal solution 7* = j3 2 . Note that in view of 
Theorem A. 3, W2 does not have to satisfy the conditions (A. 6). 

5 Algorithms for sum rate maximization 

In this section, we outline three algorithms for finding and estimating the maximal 
sum rates. In view of Lemma 3.1, it is enough to consider the maximal problem 
(3.2) in the case where w > 0. Theorem 3.5 gives rise to the following algorithm, 
which is the gradient algorithm in the variable p in the compact polyhedron [0, p]. 

Algorithm 5.1 

1. Choose po € [0, p]: 

(a) Either at random; 

(b) or po = p. 

2. Given p k = (p ljk , . . . ,p L ,k) T G [0,p] for k > 0, compute a = (ai, . . . ,a L ) T = 
V p ( I ) w(7(Pfc))- If sl satisfies the conditions (3.4) for p* = p k , then p k is the 
output. Otherwise let b = (b\, . . . , bi) T be defined as follows. 

(a) bi = if Pi t k = and aj < 0; 

(b) bi = ifp it k = pi and ai > 0; 

(c) bi = ai if0<pi< pi. 

Then Pk+i = Pk +tkh, where tk > satisfies the conditions Pk+i € [0, p] and 
$ w (7(pfc + tfcbfc)) increases on the interval [0,^]. 

The problem with the gradient method, and its variations as a conjugate gradient 
method is that it is hard to choose the optimal value of tk in each step, e.g. [2]. We 
now use the reformulation of the maximal problem given by (3.5). Since w > 0, 
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the function <£ w (e~ 1 ') is strictly convex. Thus, the maximum is achieved only on the 
boundary of the convex set 



« = {7£K L , logp(diag(e^)(F+(l/p z )veT))<0, V/}. (5.1) 

If one wants to use numerical methods and software for finding the maximum 
value of convex functions on bounded closed convex sets , e.g., [17], then one needs 
to consider the maximization problem (3.5) with additional constraints: 

D({F},K) = tf€D({F}), 7 >-*!}• (5-2) 

for a suitable K 3> 1. Note that the above closed set is compact and convex. The 
following lemma gives the description of the set D({F},K). 

Lemma 5.2 Let p > be given and let R be defined as in Lemma 3.6. As- 
sume that K > log/2. Let p = P(e~ K l) = (e K I - F) _1 v. Then D{{F},K) C 

log7([P, P])- 

Proof. From the definition of K, we have that e K > R. Hence, p(e~ K B{) < 1 for 
l = l,...,L. Thus —Kl G D({F}). Let 7 = e~ K l. Assume that 7 € D({F},K). 
Then 7 > — Kl. Hence, 7 = > 7. Since p(diag(7)F) < 1, Claim 2.2 yields 
that p = P(j) > P{l) = p, where P is defined by (2.6). The inequality P(i) < p 
follows from Corollary 2.4. □ 



Thus, we can apply the numerical methods to find the maximum of the strictly 
convex function <J> w (e^) on the closed bounded set D({F}, K), e.g. [17]. In par- 
ticular, we can use the gradient method. It takes the given boundary point 7 fc to 
another boundary point of 7fc + i € D({F}, K), in the direction induced by the gra- 
dient of $ w (e' 7 ). However, the complicated boundary of D({F},K) will make any 
algorithm expensive. 

Furthermore, even though the constraint set in (3.3) can be transformed into a 
strict convex set, it is in general difficult to determine precisely the spectral radius 
of a given matrix [22]. To make the problem simpler and to enable fast algorithms, 
we approximate the convex set D({F},K) by a bigger polyhedral convex sets as 
follows. Choose a finite number of points £ 1; . . . , Cm 011 the boundary of D({F}), 
which preferably lie in D({F}, K). Let 

Hi(£), . . . ,Hjv(£),£ e R L be the N supporting hyperplanes of D({F}. (Note that 
we can have more than one supporting hyperplane at Ci, and at most L supporting 
hyperplanes.) So each £ G D({F},K) satisfies the inequality Hj(£) < for j = 
1,...,N. Let 7 be defined by (2.18). Define 

D(Ci, ...,Cm,K) = {^€ M L , -Kl<£< log 7, HjK) < for j = 1, ... , N}. 

(5.3) 

Hence, -D(Ci) ■ ■ ■ , Cm > K) ^ s a polytope which contains D({F}, K). Thus 

max $ w (e^) > (5.4) 
^eD(C 1: ...x M ,K) 

max <J> w (e^). (5.5) 

^eD{{F},K) 

Since $ w (e' 7 ) is strictly convex, the maximum in (5.4) is achieved only at the ex- 
treme points of Z}(Ci> • • • , Cm> K)- The maximal solution can be found using a 
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variant of a simplex algorithm [8]. More precisely, one starts at some extreme 
point of £ £ -D(Ci) • • • > Cmj K)- Replace the strictly convex function < I> w (e" ) ') by its 
first order Taylor expansion at Then we find another extreme point rj of 

D(Ci,... ,£m, k )i such tliat ^( J ?) > = ^w(e^). Then we replace $ w (e^) 

by its first order Taylor expansion ^ v at rj and continue the algorithm. Our sec- 
ond proposed algorithm for finding an optimal 7* that maximizes (5.4) is given as 
follows. 

Algorithm 5.3 

1. Choose an arbitrarily extreme point £ £ D(Cij • • • , Cm > ^0- 

2. Lei = ^w(e €fc ) + (w o (1 + e^)" 1 o e**) T (£ - Solve the linear 
program max^ (£) subject to £ € -D(Ci> • • • > Cm> ^0 usin^ £/ie simplex algo- 
rithm in [8] by finding an extreme point of D(C 1 , ■ • • , Cm> ^0> such that 

= *w(eH 

5. Compute = P(e^ fc +!). // p& € [0, p], compute a = (ai,...,0£,) T = 
V p ( l ) w(7(Pfc))- a satisfies the conditions (3.4) for p* = p^, then pk is 
the output. Otherwise, go to Step 2 using 1 J / £ fc+1 (£). 

As in the previous section, it would be useful to consider the following related 
maximal problem: 

max w T 7. (5.6) 

7GD(Ci,-,<M^) 

This problem given by (5.6) is a standard linear program, which can be solved 
in polynomial time by the classical ellipsoid algorithm [12]. Our third proposed 
algorithm for finding an optimal 7* that maximizes (5.6) is given as follows. 

Algorithm 5.4 

1. Solve the linear program max-y w T 7 subject to 7 G D (Ci, . . . , Cm> ^0 using the 
ellipsoid algorithm in [12]. 

2. Compute p = P(e" r ). If p G [0, p], then p is the output. Otherwise, project p 
onto [0, p]. 

We note that 7 € D (Ci, . . . ,C M ,K) in Algorithm 5.4 can be replaced by the 
set of supporting hyperplane D(F,K) = {7 £ /9(diag(e' T )-F) < 1, 7 > —XI} 
(extending (4.6)) or, if L > 3 and w satisfies the conditions (A. 6), D(F, K) = {76 
p(diag(e' 7 )F) < 1, 7 > —Kl} based on the relaxed maximal problems in Section 
4. Then Theorem 4.5 and 4.6 quantify the closed-form solution 7 computed by 
Algorithm 5.4. 

We conclude this section by showing how to compute the supporting hyperplanes 
Hj, j = 1, . . . , N, which define D(£ ± , . . . , Q M , K). To do that, we give a characteri- 
zation of supporting hyperlanes of D({F}) at a boundary point C € dD({F}). 

Theorem 5.5 Let p = (pi, . . .pi) T > be given. Consider the convex set 
(5.1). Let C be a boundary point of dD({F}). Then C = log7(p), where < 
P = (pi-, ■ ■ ■ iPl) T < P- The set B := {I € (L), = p{\ is nonempty. For each 
B\ = (F + (l/pi)vej)) let Hj(£) 6e defined as in Theorem A. 2, where B = Bi and 
r) = C- T/ien < 0, /or I £ B, are the supporting hyperplanes of D({F}) at C- 
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Proof. Let p = P(e^). Theorem 2.3 implies the set B is nonempty. Further- 
more, p(e**Bi) = 1 if and only if pi = p\. Hence, Q lies exactly at the intersection of 
the hypersurfaces log p(e^Bi) =0,1 £ B. Theorem A. 2 implies that the supporting 
hyperplanes of D({F}) at C are H/(£) < for / € B. □ 

We now show how to choose the boundary points £ l5 . . . , Q M G dD({F}) and 
to compute the supporting hyperplanes of D({F}) at each Q. Let p = P(e~ K 1) = 
(pi, . . . ,pl) T be defined as in Lemma 5.2. Choose M{ > 2 equidistant points in each 
interval \p.,pi\. 

jip. + (Mi — jijpi 

Pji,i = — for ji = l,...,Mi, and i = 1, . . . , L. (5.7) 

Let 

v = {P,: .//. = (Pji,i. • • • >PjL,L) T , min(pi - Pjl>1 , ...,p L - p jL , L ) = 0}. 

That is, Pj 1: ...,j L € P if and only p^,...,^ ^ p. Then 

{Ci,---,CM} = iog7(n 

The supporting hyperplanes of D({F}) at each are given by Theorem 5.5. 

6 Extension to DSL channels 

In this section, we extend our previous results in Section 3 to the multiuser DSL 
channel, where a common spectrum is divided into K frequency tones denoted by 
(K). For simplicity, we assume the standard synchronous discrete multi-tone (DMT) 
modulation over DSL, where orthogonality among subchannels of the intended sig- 
nal and the subchannels of the interference signal in different frequency tones is 
maintained. Thus, transmissions can be modeled independently on each tone. The 
achievable rate at tone k can be modeled as 

log f 1 + v 1 P L k ) ■ (6-1) 

The total data rate for each user is then obtained by adding its transmitted bits 
over all the K tones. The total power budget of the /th user is constrained (across 
all K tones) by 

Erf < ft- (6-2) 
k=i 

It is easy to see that our previous model in Section 2 is a special case of the DSL 
model assuming standard synchronous discrete multi-tone (DMT) modulation when 
K = 1. 

For brevity of notations, we define p <G MS K xi ) as a vector that stacks the K x L 
power allocation lined-up according to tones of all users. For example, if L = 2 
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and K = 2, then p = (pi,Pi,P2>Pi) • Similarly, we define 

7 G R(^xL) ag the S | R 

allocation, the following matrix 

( 0, if i = j for each i, j € (L), for each k G (if) 

yyJij=i> ™e s if ^ j for each ^- G ^ for each fc G ^ 

(6.3) 

and 

(12 ft' 1 2 K - N T 

2n 5n 5n 522 522 9llJ 
Note that F is a block diagonal matrix under the assumption of synchronous 
transmission. More generally, asynchronous transmission can result in the Ith. user 
at tone k having interference from the power allocation at neighboring tones of tone 
k. In this case, F needs not be block diagonal. 

As in the previous, the Zth user is assigned a weight wi as an indicator of long- 
term priority. The problem of maximizing sum rate in a Gaussian interference- 
limited channel with K frequency tones can be stated as the following maximum 
problem 

L K 

max V wi V log(l + 7 f (p)), (6.5) 

Ef=iP?<p«vi^ ti 

where 7^ = gf t pf / '(Ylj^i 9ijPj + n i)- ^ n or der to be consistent with the previous 
results, where we assume w to be a probability vector, it is more convenient to 
consider the following equivalent problem: 



L K 

max VV^ fe log(l+7f(p))- (6.6) 



where wi k = (wi/K) for all / <G (L),k € (K), and w € R( KxL ^ is a probability 
vector. 

As in Section 3, it is instrumental to consider an alternative formulation of (6.6). 

Theorem 6.1 The maximum problem (6.6) is equivalent to the maximum prob- 
lem. 

maximize YjI Hk ™lk lo g(! + li) 

subject to /5 (diag( 7 )(F + (l/p i )EfeVeT x0 ))<l V/G(L), (6.7) 
variables: 7^, V/, V/c. 

7* is a maximal solution of the above problem if and only if -P(7*) is a maximal 
solution p* of the problem (6.6). In particular, any maximal solution 7* satisfies 
the equality 

p ^diag( 7 ) (f+ (l/ P OE ve 5xo)) = 1 ( 6 - 8 ) 

for some integer I € [1, L\. 

Using Theorem 6.1, it is straightforward to extend our three algorithms in Sec- 
tion 5 for desigining optimal spectrum management schemes in DSL channels. We 
note however that our techniques are general enough to model asynchronous trans- 
mission, where the power allocations in neighboring tones interfere with one another. 
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This results in intercarrier interference (ICI) and tone coupling in (6.1); as mentioned 
earlier, ICI effects are captured by F in (6.3), where the optimal solution to (6.6) is 
then characterized by the spectrum of different interference matrices. 

In comparison to existing algorithms used for solving (6.5), we highlight briefly 
the advantages of our algorithms. Assuming synchronous transmission, the Optimal 
Spectrum Balancing (OSB) method is considered the state-of-the-art algorithm for 
computing the spectrum allocation [16]. The key idea of OSB is to leverage Lagrange 
dual decomposition to decouple the many uncoupled terms in (6.5) into L lower 
dimensional subproblems (whose cost function on each tone is still nonconvex). 
However, this decoupling technique in the power domain cannot be done in the 
case of asynchronous transmission. On the other hand, in both synchronous and 
asynchronous transmission, Algorithm 5.3 and 5.4 with the choice of supporting 
hyperplanes using the Friedland-Karlin inequalities permit decoupling of both users 
and frequency tones in the SIR domain. 

7 Conclusion 

We looked at optimizing power control to maximize the weighted sum rate in a Gaus- 
sian interference-limited channel that models multiuser communication in a CDMA 
wireless network or DSL cable binder. Using tools from irreducible nonnegative ma- 
trix theory, in particular the Perron-Frobenius Theorem and the Friedland-Karlin 
inequalities, we provided insights into the structural property of optimal power allo- 
cation strategies that maximize sum rates. We showed that sum rate maximization 
can be restated as maximizing a convex function on a closed convex set. We pro- 
posed three algorithms to find the exact and approximate values of the optimal sum 
rates. In particular, our algorithms exploited the eigenspace of specially crafted 
nonnegative interference matrices, which, with the use of standard optimization al- 
gorithms, can provide useful upper bounds and feasible solutions to the nonconvex 
problem. 

A Appendix: Friedland-Karlin results 

In this section, we recall some results from [10] and state the extensions of these re- 
sults, and then illustrate their applications in this paper. We first state the following 
extension of [10, Theorem 3.1]: 

Theorem A.l Let A £ M^ xL be an irreducible matrix. Assume that x(A) = 
(xi(A), . . . , xl{A)) t , y(A) = (yi(A), . . . ,ul(A)) t > are left and right Perron- 
Frobenius eigenvectors of A, normalized such that x(A)oy(A) is a probability vector 
Suppose 7 is a nonnegative vector. Then 



i 

If 7 is a positive vector then equality holds if and only if all 72 are equal. Further- 
more, for any positive vector z = (z±, . . . , zl) t , the following inequality holds: 



(x(A)oy(A)) ! 



< p(diag(7)A). 



(A.l) 




(A.2) 
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If A is an irreducible matrix with positive diagonal elements, then equality holds in 
(A. 2) if and only if z = tx(A) for some positive t. 

Proof. Theorem 3.1 in [10] makes the following assumptions. First, in the 
inequality (A.l), it assumes that 7 > 0. Second, in (A. 2), it assumes that p(A) = 1. 
Third, the equality case in (A. 2) for z > is stated for a positive matrix A. We 
now show how to deduce the stronger version of Theorem 3.1 claimed here. 

First, by using the continuity argument, we deduce the validity of (A.l) for any 
7 > 0. Second, by replacing A by tA, where t > 0, we deduce that it is enough to 
show (A. 2) in the case p(A) = 1. 

Third, to deduce the equality case in (A. 2) for z > 0, we need to examine the 
proof of Lemma 3.2 in [10]. The proof of the Lemma 3.2 applies if the following con- 
dition holds. For any sequence of probability vectors Zj = {zi^, . . . , zl,i) t ,i = 1, . . ., 
which converges to a probability vector £ = (£1, . . . , (l) T , where £ has at least one 

zero coordinate, the function Yli=i ( ~ip ) tends to 00 on the sequence 

Zj, i = 1, Assume that A = {I £ (L), Q = 0}. Note that the complement of A in 

(L), denoted by A c is nonempty. 

Since A = [a^] has positive diagonal entries, it follows that > an > for 

each I G (L). Since A is irreducible, we deduce that there exist I £ A and m € A c 
such that ai m > 0. Hence, lim^oo i^h. = This shows that the unboundedness 

z l,i 

condition holds. 

□ 



The following result gives an interpretation of the inequality (A.l) in terms of 
the supporting hyperplane of the convex function logp(e^-B), where B £ IR^ xL is 
irreducible and ^6l L . 

Theorem A. 2 Let B € R^ xL be an irreducible matrix. Let r] = (771, ... , t/l) T € 
R L satisfy the condition p{e v B) = 1. Denote A = e^B and assume that 
x(A) = (xi(A),...,x L (^)) T ,y(A) = ( yi (A),...,y L (A)) T > are left and right 
Perron- Frobenius eigenvectors of A, normalized such that x(^4)oy( A) is a probability 
vector. Let 

L 

n(£) = J2xi(A)yi(A)((; l -r ll ). (A.3) 
1=1 

Then H(£) < is the unique supporting hyperplane to the convex set log p(e*B) < 
at £ = r/. 

Proof. Let £ G R L . Then e^B = e* -TJ A Theorem A.l implies that H(£) < 
log p(e*B). Thus, H(£) < if log p(e*B) < 0. Clearly, R(rj) = 0. Hence, H(£) < 
is a supporting hyperplane of the convex set log p(e*B) < 0. Since the function 
log p(e^B) is a smooth function of £, it follows that H(£) < is unique. □ 

We now give an application of (A. 2) in Theorem A.l. 

Theorem A.3 Let B G M^ xL be an irreducible matrix. Let ij = (rji, ... , t/l) t € 
R L . Let w = x(diag(e T? )S) o y(diag(e 1? )i?) = (w\, . . . ,wl) T be a probability vector. 
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Then for any positive vector z = (z±, ... , zl) 
l 

X^ lo S7oV - -logp(diag(e T7 )S) + X^/^- ( A - 4 ) 
l=i ^ Z ' 1 1=1 

If B has a positive diagonal, then equality holds if and only if z = tx(diag(e T? ) B) 
for some t > 0. 

Proof. Let A = diag(e v )B. Then 

U>Z log = > ioj log > W/Tft. 

1=1 ^ ti Zl ti 

Use (A. 2) to deduce (A. 4). The equality case follows from the equality case in (A. 2). 

□ 

We now study the following inverse problem. 

Problem A.4 Let B G M+ xL ,w G be given irreducible matrix and pos- 
itive probability vector, respectively. When does there exist rj G M. L such that 
x(diag(e T? )i?) o y(diag(e T ')i?) = w? If such rj exists, when is it unique up to an 
addition tl ? 

To solve the inverse problem, we recall Theorem 3.2 in [10]. 

Theorem A. 5 Let A G M^ xL ,u, v G be given, where A is irreducible with 
positive diagonal elements and u, v are positive. Then there exists D±,D2 G R+ xi 
such that 

L>iAD 2 u = u, v T L>iAD 2 = v T , D x = diag(f), D 2 = diag(g) and f,g > 0. (A.5) 

The pair {D\,D2) is unique to the change (tDi,t -1 D2) for any t > 0. There exist 
T) G M L stic/i t/iai x(diag(e T? B)) o y(diag(e T? i?)) = w. Furthermore, rj is unique up 
to an addition tl. 

Corollary A.6 Let B G M+ xL , w G be given irreducible matrix with positive 
diagonal elements and positive probability vector, respectively. Then there exists 
rj G M L such that x(diag(e T? )S) o y(di&g(e v )B) = w. Furthermore, rj is unique up 
to an addition oftl. 

Proof. Let u = 1, v = w. Then there exists D\,Di two diagonal matrices with 
positive diagonal entries such that D\BDq\ = 1, w T D1BD2 = w T . Consider the 
matrix D2D1B = D2{D\BD2)D2 ■ It is straightforward to see that x(L>2-Di-B) o 
y(D2DiB) = w. Hence, rj is the unique solution of diag(e ?7 ) = D2D1. 

Assume that £ G M L satisfies x(diag(e^)-B) o y(diag(e^)i?) = w. By considering 
£ = £ + tl, we may assume that p(di&g(e^)B) = 1. Let D4 = diag(x(diag(e^)i?). 
Then (D^ 1 diag(e<)BD 4 )l = 1. Let D 3 = 1 diag(e^). Hence, y(D 3 BD 4 ) = w. 
In view of Theorem A.5, diag(e^) = D4D3 = D2D1 = diag(e T? ). □ 
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Unfortunately, the matrix F defined in (2.3) have zero diagonal entries and 
positive off-diagonal entries. For L = 2, it is easy to show that x(i ? )oy(F) = |) T . 
In particular, for L = 2, Problem A. 4 is not solvable for w ^ (i, i) T . Similarly, 
given positive u, v G M 2 such that u o v ^ t(l, 1) for any positive t, (A. 5) does not 
hold for A = F. For L > 3, the situation is different. 

Theorem A.7 Let L > 3, A G M+ xi ,u = (m, . . . ,u L ) T , v = (vi, . . . ,v L ) T € 
be given, where A is a matrix with zero diagonal entries and positive off-diagonal 

elements, and u, v are positive. Assume that w = u o v is a probability vector 

satisfying the condition 

^ Wj > wi for all I G (L). (A.6) 

on all jj^l 

Then there exists D\,D 2 G M+ xi such that (A. 5) holds. 

Proof. Let A4 = A + (l/i)I,i = 1, . . ., where / is the L x L identity matrix. 
Theorem A. 5 implies 

Bi = D hi AiD 2 ,i, D hi = diag(fj), D 2 ,i = diag(gj), B { u = u, v T B { = v T , 

fi = (fi,i,-- -,fL,i) T , Wj = (g lti , . . .,g L ,i) T , Si = max/j-j = max g j4 , i = 1, . . . . 

je{L) je(L) 

Note that each entry of Bi is bounded by " J . By passing to the subsequence 
Bi k ,fi k ,gi k , 1 < ii < *2 < • • •, we can assume that the first subsequence converges 
to B, and the last two subsequences converge in generalized sense: 

lim B ik =B = [b jt ] G M^ xL , lim f ifc = f = (f u . . . , f L ) T , lim gl = g = ( 5l , . . . , g L 

k— >oo fc— >oo fe— >oo 

G [0, 00], j = 1, . . . , L, lim s ik = s = max /_,- = max gj G [0, 

k^oo je(L) je{L) 

Note that 

Bu = u, v T B = v T . (A.7) 

Assume first that s < 00. Then = diag(f)^4 diag(g). In view of (A.7), fog > 0. 
This proves the theorem in this case. 
Assume now that s = 00. Let 

^00 = {j € (L), /j = 00}, J+ = {j G (L), fj G (0, 00), } Jo = {j G (L), = 0}, 
£oc = {j G (£), 9j = 00}, = {j G (L), G (0, 00), } Go = {j G (L), gj = 0}. 

Since off-diagonal entries of A are positive, and B G M+ xL it follows that Too = 
Goo = {0 for some / G (L). Furthermore, = G+ = 0- So = Qq = {L)\{1}. 
Assume first that / = 1. Then the principal submatrix [bji]^ =l=2 is zero. (A.7) yields 
that 

L 

bji = — , b\j = — for j = 2, . . . , L, bnuiv\ + > UjVj = uiv\. 
ui vi 

3=2 

Since bn > 0, the above last equality contradicts the condition (A.6) for I = 1. 
Similar argument implies the impossibility of J 7 ^ = Goo = {1} for any I > 2. Hence, 
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s < oo and we conclude the theorem. 



□ 



We do not know whether, under the conditions of Theorem A. 7, the diagonal 
matrices (Di,!^) are unique up to the transformation (tDi,t~ 1 D2). We now gen- 
eralize the above theorem. 

Theorem A. 8 Let 

L>2, yl=[a4 =l=1 elf, < u = (u u . . . , u L ) T , v = (vi, . . . , v L ) T G 

be given. Assume that A has positive off- diagonal elements, and w = u o v is a 
probability vector satisfying the condition 

Wj > wi (A.8) 

on all jy^l 

for each I such that an = 0. Then there exists D\, D2 G M+ xL such that (A. 5) holds. 

Proof. Assume first that L > 3. In view of Theorems A. 5 and A. 7, it is enough 
to assume that A has positive and zero diagonal entries. Apply the proof of Theorem 
A. 7 and the following observation. If = Goo = {1} then an = 0. 

Assume now that L = 2. Note that if A has a zero diagonal then the condition 
(A. 6) can not hold. Assume now that A has at least one positive diagonal element. 
Then the above arguments for L > 3 apply. □ 

Corollary A. 9 Let B = [bji]^ =l=1 G M^ xL ,w G be given matrix with pos- 
itive off-diagonal elements and a positive probability vector, respectively. Assume 
that L > 2 and w satisfies the conditions (A.8) for each I such that bu = 0. Then 
there exists rj Gl L such that x(diag(e 7J )S) o y(diag(e ?7 )i?) = w. 
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